Probabilistic compensation of unreliable feature components for robust speech recognition

نویسندگان

  • Cyan L. Keung
  • Oscar C. Au
  • Chi H. Yim
  • Carrson C. Fung
چکیده

Missing feature theory is well studied in robust ASR context, many works have been done on additive noise of different colors. These are based mainly on classical spectral subtraction and marginal density techniques. This paper addresses the problem of temporal distortion of feature components, that is all about time domain instead of frequency one. No specific noise model and extract computation needed. We showed that the digit words recognition rate is above 95%, given test samples are clean with 10dB white noise added to middle 30% portion of speech along the time axis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Robust Speech Recognition: The case for restoring missing features

Speech recognition systems perform poorly in the presence of corrupting noise. Missing feature methods attempt to compensate for the noise by removing unreliable noise corrupted components of a spectrographic representation of the noisy speech and performing recognition with the remaining reliable components. Conventional classifiercompensation methods modify the recognition system to work with...

متن کامل

Missing feature theory and probabilistic estimation of clean speech components for robust speech recognition

In the framework of Hidden Markov Models (HMMs), this paper presents a new approach towards robust speech recognition in adverse conditions. The approach is based on statistical modeling of noise by Gaussian distributions and an estimation of idealized clean speech directly in the probabilistic domain using a statistical spectral subtraction method and missing feature compensation. The missing ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000